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DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351 (a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

2. Claims 1-2, 5-9, 14, 15-16, 19-23, 29-32 are rejected under 35 U.S.C. 102(e) as 
being anticipated by Thelen et al. (6,487,534 filed 3/23/00) 



The table below summarizes the limitations of this application and the 
corresponding parts of Thelen et al. which read on these limitations. 



!:Clairn|| 


in! lulls' ■■■■^ : ■l]imitations;;f l-ll ; ^ M,.--\- Jl 




1, 15 


A speech recognition system in which a client and a 






device that provides a speech recognition process are 


Client + plurality of servers (Col. 2, hnes 




connected, and which provides a plurality of usable 


32-43 and FIG. 6) 




speech recognition means to the chent, comprising: 






speech input means for inputting speech at the 


(inherent for speech reception of speech 




client 


input signal, 740, FIG. 7 or 331, FIG. 3) 




designation means for designating one of the 


speech controller and switch direct speech 




plurality of usable speech recognition means 


input signal to server stations (Col. 2, lines 






32-43, and elems. 335-6, FIG. 3) 
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and processing means for making the speech 
recognition means designated by said designation means 
recognize speech input from said speech input means. 


recognition on the server (314, FIG. 3) 


2, 16 


The system according to claim 1, wherein said 
designation means comprises: 

detection means for detecting designation 
information used to designate one of the plurality of 
usable speech recognition means from the speech input by 
said speech input means, and 

said designation means designates one of the 
plurality of usable speech recognition means on the basis 
of the designation information detected by said detection 
means. 


Local recognizer parses user's commands 
(334, FIG. 3 and Col. 2, lines 40-43) 

Controller performs selection of the server 
and the routing of speech data to that server 
(335, FIG. 3 and Col. 2, lines 40-43) 


5, 19 


The system according to claim 1, wherein each of said 
plurality of usable speech recognition means is capable of 
converting data format of speech input by said speech 
input means into a speech data format that can be 
recognized by speech recognition processing when the 
data format of the speech input by said speech input 
means is different from the speech data format. 


Speech recognition systems inherently 
convert the format of input speech into a 
different data format used for speech 
recognition (A/D conversion, framing, 
filtering, feature vectors, etc.) (Col. 4, 14- 
28) 


6,20 


A speech recognition system in which a client and a 
plurality of devices that provide a speech recognition 
process are connected, wherein the client comprises: 
speech input means for inputting speech 

designation means for designating one of the 
plurality of speech recognition process, 

wherein said system comprises processing means 
for making a speech recognition process provided by a 
device designated by said designation means, recognize 
the speech input from said speech input means. 


Client + plurality of servers (Gol. 2, lines 
32-43 and FIG. 6) 

(inherent for speech reception of speech 
input signal, 740, FIG. 7 or 331, FIG. 3) 

speech controller and switch direct speech 
input signal to server stations (Col, 2, lines 
32-43, and elems. 335-6, FIG. 3) 

recognition on the server (3 14, FIG. 3) 
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7,21 


The system according to claim 6, wherein said chent 
farther comprises detection means for detecting speech 
corresponding to designation information used to 
designate one of the plurahty of usable speech recognition 
means, from the speech input by said speech input means, 

wherein said designation means converts the 
speech detected by said detection means into designation 
information by using a speech recognition on the client, 
and designates one of the plurahty of said speech 
recognition means on the basis of the converted 
designation information. 


Local recognizer parses user*s commands: 
"Select Philips, (334, FIG. 3 and Col. 2, 
lines 40-43), Speech recognizer is 
inherently built to detect speech. 

Local recognizer/controller performs 
selection of the server and the routing of 
speech data to that server (335, FIG. 3 and 
CoL 2, lines 40-43) 


8,22, 
29,31 


A chent in a speech recognition system to which a device 
that provides a speech recognition process is connected, 
and which provides a plurality of usable speech 
recognition means, comprising: 

speech input means for inputting speech 

and dp'sip^natinti mpans for df'^iffnatinP one of the 
plurality of usable speech recognition means 

and processing means for making the speech 
recognition means designated by said designation means 
recognize speech input from said speech input means. 


Client + plurality of servers (Col. 2, lines 
32-43 and FIG. 6). Client computers 
necessarily contain storage mediums (RAM, 
hard disks, etc.) + software. 

(inherent for speech reception of speech 
input signal, 740, FIG. 7 or 331, FIG. 3) 

speech controller and switch direct speech 
input signal to server-stations (Col. 2, lines 
32-43, and elems. 335-6, FIG. 3) 

recognition on the server (314, FIG. 3) 


9, 23 


The client according to claim 8, wherein said designation 
means comprises: 

detection means for detecting designation 
information used to designate one of the plurality of 
usable speech recognition means from the speech input by 
said speech input means, and 


Local recognizer parses user's commands: 
^^Select Philips, " (334, FIG. 3 and Col. 2, 
lines 40-43). Speech recognizer is 
inherently built to detect speech. 
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said designation means designates one of the 
plurality of usable speech recognition means on the basis 
of the designation information detected by said detection 
means. 


Local recognizer/controller performs 
selection of the server and the routing of 
speech data to that server (335, FIG. 3 and 
Col 2, lines 40-43) 


14,28, 
30, 32 


A server which is connected to a client via a network, and 
provides a plurality of usable speech recognition means to 
the client, comprising: 

providing means for providing a designation 
operation environment that allows the client to designate 
one of the plurality of usable speech recognition means 

and processing means for making the speech 
recognition means designated by said providing means 
recognize speech received via the network. 


(servers necessarily contain storage 
mediums + software (RAM, hard disks, 
etc.)) 

Web page containing advertising banners of 
different companies, having their own 
recognition servers (CoL 2, lines 36-40). 
Either the whole web-page or specific 
banners read on the "providing means for 
providing a designation operation." 

Speech recognizer on a dedicated SR server 
(314, FIG. 3) 



Claim Rejections - 35 use §103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 3-4, 10-13, 17-18, 24-27 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Thelen et al. 
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As per claims 3, 11, 13, 17, 25, 27, Thelen et al. do not disclose "GUI input 
means for designating one of the plurality of usable speech recognition means via 
graphical user interface." 

However. Thelen et al. disclose the use of a web page containing advertising 
banners of different companies, each company having its own recognition servers (Col. 
2, lines 36-40), The users select the banner by saying the banner's name, tag name, 
etc. (Col. 2, lines 45-48). The examiner takes the official notice that it is extremely well- 
known in the art to navigate web pages using GUI by clicking on the banner using a 
standard 10 device, such as a computer mouse. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Thelen et al. to choose recognition servers by 
using a standard GUI to select the associated banners (by clicking on them, for 
example), as this is the most well-known and universally accepted method of navigating 
web pages, and would allow the users to browse the "speech recognition" web pages in 
a way identical to browsing the "regular" web pages. 

As per claims 4, 12, 18, 26, Thelen et al. discloses receiving input speech (740, 
FIG. 7) but do not explicitly disclose using "plurality of speech Input means." 

However, the examiner takes the official notice that input systems with multiple 
microphones (such as microphone arrays) are notoriously well-known in the art and are 
used in a variety of systems adapted for noisy or spatially large environments 
(conference rooms, car SR, home entertainment systems, etc.). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Thelen et al. to use multiple speech input means 
(microphones), as such arrangement is very well-known in the art, and would allow the 
users of Thelen et al.'s system to improve the reception of speech signal in noisy or 
spatially large environments (conference rooms, etc.) 



As per claims 10, 24, Thelen et al. do not disclose a situation "when said 
detection means does not detect any designation information, said processing means 
recognizes the speech input from said speech input means without using the plurality of 
speech recognition means." 

However, Thelen et al. disclose an embodiment where a local recognizer is used 
to identify a "routing command" and direct the speech signal to the appropriate server 
(Col. 9, lines 4-16). In other words, a local recognizer does recognize input speech, but 
if the routing command was not present in the table or was not specified by a user (Col. 
9, lines 5-10), then the client station would not be able to contact the SR server for the 
further speech recognition. Hence, assuming that no default SR server is specified, 
whenever the client cannot resolve the destination server, all the speech processing 
must be either performed directly on the client using a limited speech recognizer (334, 
FIG. 3) or aborted altogether. Since the latter option is not useful, the limited speech 
recognizer on the client is the only viable device that can perform speech recognition in 
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the situation descnbed above or when small commands are used for operation of the 
client's workstation (Col. 8, lines 49-60) 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Thelen et al. to use a local recognizer as a back- 
up option whenever the remote server is not specified in order to provide the user with 
some speech recognition capability instead of aborting the transaction altogether. This 
would improve the overall usability of the system and would work particularly well in 
situations where the small commands are used in operation of the client's workstation 
(Col.8, lines 49-60) 

Conclusion 

5. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Thelen et al. (6,526,380) teach a system with multiple parallel recognizers. 
White et al. (6,408,272) teach a distributed voice recognition system. 
Joost (6,327,568) teaches a distributed voice recognition system. 

6. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Dmitry Brant whose telephone number is (703) 305- 
8954. The examiner can normally be reached on Mon. - Fri. (8:30am - 5pm). 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Talivaldis Ivars Smits can be reached on (703) 306-301 1 . The fax phone 
number for the organization where this application or proceeding is assigned is (703) 
872-9306. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to Tech Center 2600 receptionist whose telephone 
number is (703) 305- 4700. 




